GMM Classifier for Identification of Neurological Disordered Voices Using MFCC Features
نویسندگان
چکیده
Automatic detection of neurological disordered subjects voice mostly relies on parameters extracted from time-domain processing. The calculation of these parameters often requires prior pitch period estimation; which in turn depends heavily on the robustness of pitch detection algorithm. In the present work cepstraldomain processing technique which does not require pitch estimation has been adopted to extract the features of voice signal. The Mel frequency cepstral coefficients (MFCCs) are computed using two methods; the fast Fourier transform (FFT) and the linear predictive coding (LPC) method. The cepstral parameters estimated from these methods are used as features to classify normal subject voice from neurologically disordered subject’s voice using Gaussian mixture model (GMM). The results of the two methods are compared, and it is found that the accuracy of LPC-MFCC based GMM classifier is 89.55% compared to FFT-MFCC based GMM classifier which is giving an accuracy of classification of 88.5%.
منابع مشابه
A two-stage approach using Gaussian mixture models and higher-order statistics for a classification of normal and pathological voices
A two-stage classifier is used to improve the classification performance between normal and pathological voices. A primary classification between normal and pathological voices is achieved by the Gaussian mixture model (GMM) log-likelihood scores. For samples that do not meet the thresholds for normal or disordered voice in the GMM, the final decision is made by a higher-order statistics (HOS)-...
متن کاملImproved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter ban...
متن کاملMusic Instrument Identification Using MFCC: Erhu as an Example
In the analysis of musical acoustics, we usually use the power spectrum to describe the difference between timbres from two music instruments. However, according to our experiments, the power spectrum cannot be used as effective features for erhu instrument identification. In this paper, we use MFCC (mel-scale frequency cepstral coefficients) as features for music instrument identification usin...
متن کاملNative Language Identification Using Spectral and Source-Based Features
The task of native language (L1) identification from nonnative language (L2) can be thought of as the task of identifying the common traits that each group of L1 speakers maintains while speaking L2 irrespective of the dialect or region. Under the assumption that speakers are L1 proficient, non-native cues in terms of segmental and prosodic aspects are investigated in our work. In this paper, w...
متن کاملPerformance Analysis of Speaker Identification System Using GMM with VQ
Personal identity identification is an important requirement for controlling access to protected resources. Biometric identification by using certain features of a person is a more secured solution for security identification. Advances in speech processing technology and digital signal processors have made possible the design of high-performance and practical speaker recognition systems. A more...
متن کامل